Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants.

نویسندگان

  • Aziz Belkadi
  • Alexandre Bolze
  • Yuval Itan
  • Aurélie Cobat
  • Quentin B Vincent
  • Alexander Antipenko
  • Lei Shang
  • Bertrand Boisson
  • Jean-Laurent Casanova
  • Laurent Abel
چکیده

We compared whole-exome sequencing (WES) and whole-genome sequencing (WGS) in six unrelated individuals. In the regions targeted by WES capture (81.5% of the consensus coding genome), the mean numbers of single-nucleotide variants (SNVs) and small insertions/deletions (indels) detected per sample were 84,192 and 13,325, respectively, for WES, and 84,968 and 12,702, respectively, for WGS. For both SNVs and indels, the distributions of coverage depth, genotype quality, and minor read ratio were more uniform for WGS than for WES. After filtering, a mean of 74,398 (95.3%) high-quality (HQ) SNVs and 9,033 (70.6%) HQ indels were called by both platforms. A mean of 105 coding HQ SNVs and 32 indels was identified exclusively by WES whereas 692 HQ SNVs and 105 indels were identified exclusively by WGS. We Sanger-sequenced a random selection of these exclusive variants. For SNVs, the proportion of false-positive variants was higher for WES (78%) than for WGS (17%). The estimated mean number of real coding SNVs (656 variants, ∼3% of all coding HQ SNVs) identified by WGS and missed by WES was greater than the number of SNVs identified by WES and missed by WGS (26 variants). For indels, the proportions of false-positive variants were similar for WES (44%) and WGS (46%). Finally, WES was not reliable for the detection of copy-number variations, almost all of which extended beyond the targeted regions. Although currently more expensive, WGS is more powerful than WES for detecting potential disease-causing mutations within WES regions, particularly those due to SNVs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Whole Exome Sequencing Reveals a BSCL2 Mutation Causing Progressive Encephalopathy with Lipodystrophy (PELD) in an Iranian Pediatric Patient

Background: Progressive encephalopathy with or without lipodystrophy is a rare autosomal recessive childhood-onset seipin-associated neurodegenerative syndrome, leading to developmental regression of motor and cognitive skills. In this study, we introduce a patient with developmental regression and autism. The causative mutation was found by exome sequencing. Methods: The proband showed a gener...

متن کامل

Whole Exome Sequencing Reveals a XPNPEP3 Novel Mutation Causing Nephronophthisis in a Pediatric Patient

Background: Nephronophthisis (NPHP) is a progressive tubulointestinal kidney condition that demonstrates an AR inheritance pattern. Up to now, more than 20 various genes have been detected for NPHP, with NPHP1 as the first one detected. X-prolyl aminopeptidase 3 (XPNPEP3) mutation is related to NPHP-like 1 nephropathy and late onset NPHP. Methods: The proband (index patient) had polyuria, polyd...

متن کامل

Editorial: The Post-Exome Era

The Iranian Rehabilitation Journal (IRJ) invites research papers on the genetic basis of single gene and complex disorders. This vastly dynamic branch of science will complement the multidisciplinary wealth of expertise in the fields of social welfare and rehabilitation. The past few years have witnessed outstanding research projects on the genetic causes of numerous debilitating disorders, suc...

متن کامل

Semantic prioritization of novel causative genomic variants

Discriminating the causative disease variant(s) for individuals with inherited or de novo mutations presents one of the main challenges faced by the clinical genetics community today. Computational approaches for variant prioritization include machine learning methods utilizing a large number of features, including molecular information, interaction networks, or phenotypes. Here, we demonstrate...

متن کامل

Approaches to the detection of recessive effects using next generation sequencing data from outbred populations

Conventional methods to analyze genome-wide association studies and whole exome or whole genome sequencing studies would be prone to overlook variants which might exert a recessive effect on risk of disease, either as homozygotes or compound heterozygotes. It is plausible that such effects may be common even in outbred populations. An approach is described which is based on identifying a set of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 112 17  شماره 

صفحات  -

تاریخ انتشار 2015